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LAWS OF LARGE NUMBERS FOR EPIDEMIC MODELS WITH 
COUNTABLY MANY TYPES 
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In modeling parasitic diseases, it is natural to distinguish hosts 
according to the number of parasites that they carry, leading to a 
countably infinite type space. Proving the analogue of the determin- 
istic equations, used in models with finitely many types as a "law 
of large numbers" approximation to the underlying stochastic model, 
has previously either been done case by case, using some special struc- 
ture, or else not attempted. In this paper we prove a general theorem 
of this sort, and complement it with a rate of convergence in the 
^l-norm. 

1. Introduction. This paper is concerned with generalizations of the 
stochastic models introduced in Barbour and Kafetzaki (1993) and devel- 
oped in Luchsinger (2001a, 2001b), which describe the spread of a parasitic 
disease. With such diseases, it is natural to distinguish hosts according to 
the number of parasites that they carry. Since it is not usually possible to 
prescribe a fixed upper limit for the parasite load, this leads to models with 
countably infinitely many types, one for each possible number of parasites. 
The model considered by Kretzschmar (1993) is also of this kind, though 
framed in a deterministic, rather than a stochastic form. Then there are 
models arising in cell biology, in which, for instance, hosts may be replaced 
by cells which are distinguished according to the number of copies of a par- 
ticular gene that they carry, a number which is again, in principle, unlimited; 
see Kimmel and Axelrod (2002), Chapter 7, for a selection of branching pro- 
cess examples. The metapopulation model of Arrigoni (2003) also allows for 
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infinitely many types of patches, here distinguished by the size of the pop- 
ulation in the patch. 

The fact that there are infinitely many types can cause difficulty in prob- 
lems which, for finitely many types, would be quite standard. To take a well 
known example, in a super-critical Galton-Watson branching process with 
finitely many types, whose mean matrix is irreducible and aperiodic and 
whose offspring distributions have finite variance, the proportions of indi- 
viduals of the different types converge to fixed values if the process grows 
to infinity: a rather stronger result is to be found in Kesten and Stigum 

(1966) . If there are infinitely many types, little is generally known about the 
asymptotics of the proportions, except when the mean matrix is r-positive, 
a condition which is automatically satisfied in the finite case; here, Moy 

(1967) was able to prove convergence under a finite variance condition. For 
epidemic models analogous to those above, but with only finitely many types, 
there are typically "law of large numbers" approximations, which hold in the 
limit of large populations, and are expressed in the form of systems of dif- 
ferential equations: see, for example, Bailey (1968) or Kurtz (1980). Proving 
such limits for models with infinite numbers of types is much more deli- 
cate. Kretzschmar (1993) begins with the system of differential equations as 
the model, and so does not consider the question; in Barbour and Kafetzaki 
(1993) and Luchsinger (2001a, 2001b), the arguments are involved, and make 
use of special assumptions about the detailed form of the transition rates. 

In this paper we prove a law of large numbers approximation with explicit 
error rates in some generality. The models that we allow are constructed by 
superimposing state-dependent transitions upon a process with otherwise 
independent and well-behaved dynamics within the individuals; the state- 
dependent components are required to satisfy certain Lipschitz and growth 
conditions, to ensure that the perturbation of the underlying semi-group 
governing the independent dynamics is not too severe. The main approx- 
imation is stated in Theorem 3.1, and bounds the difference between the 
normalized process N~ 1 Xn and a deterministic trajectory x with respect 
to the £i-norm, uniformly on finite time intervals. Here, N is a "typical" pop- 
ulation size, and Xn is shorthand for {(X 3 N (t), j £ Z + ),i > 0}, where X 3 N {t) 
denotes the number of hosts at time t having j parasites. The theorem is 
sufficiently general to cover all the epidemic models mentioned above, ex- 
cept for that of Kretzschmar (1993), where only a modified version can be 
treated. 

The processes that we consider can be viewed in different ways. One is 
to interpret them as mean-field interacting particle systems. Leonard (1990) 
has used this approach to study the large population behavior of a number 
of epidemic models, though he requires the generators to be bounded, which 
is an unnatural assumption in our parasite systems. Alternatively, the value 
N^ 1 X^{t) of the normalized process at time t can be seen as a measure 
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on Z_|_, establishing connections with the theory of measure- valued processes. 
Eibeck and Wagner (2003) prove law of large numbers limit theorems in a 
rather general setting, aimed principally at coagulation and fragmentation 
models, and allowing infinite numbers of types and unbounded generators; 
however, they do not establish error bounds. 

Our argument proceeds by way of an intermediate approximation, based 
on a system Xn consisting of independent particles, which has dynamics 
reflecting the average behavior of X^. The deterministic trajectory x is 
discussed in Section 3, the approximation of N~ 1 X^ by x in Section 4, 
culminating in Theorem 4.5, and the final approximation of N~ 1 Xn by x 
in Section 5. 

2. Specifying the model. Our model is expressed in terms of a sequence 
of processes Xjy having state space X := {£ G : J2i>o€ < where Z + 
denotes the nonnegative integers and £ l the ith component of £; we also use 
e(i) to denote the ith coordinate vector. The process X^it) := (X 3 N (t) :j G 
Z + ) , t > 0, has J2j>o X 3 N (0) = N, and evolves as a pure jump Markov process 
with transition rates given by 

^ C + e(j) - e(i) at rate f {ay + a^A^)}, i > 0, j > 0, j + i; 

£^£ + e{i) at rate NpiiN- 1 ^), i>0; 

C^C- e{i) at rate + S^N' 1 ^, * > 0, 

for any £ & X, where the nonnegative quantities aij,aij,j3i,Si and Si are used 
to model different aspects of the underlying parasite life cycle. 

We interpret X l N (t) as the number of hosts who carry i parasites at time t. 
If only the constant terms in the transitions were present, parasite commu- 
nities would develop independently within different hosts, according to a 
pure jump Markov process with infinitesimal matrix a, also including the 
possibility of host death at rate <5j for hosts with i parasites. 

The remaining terms in the transitions are used to model interactions, 
and hence vary as a function of the levels x = N" 1 ^ G N~ l X of infection 
in the host population. The a™ are associated with transitions in which 
the number of parasites within a particular host changes, and can be used 
to model interactions involving both infection with new parasites and loss 
of infection through parasite death. The remaining transitions allow one to 
model state varying rates of births, deaths and immigration of hosts, in the 
latter case possibly themselves infective. 

The components of the transitions rates are required to satisfy a number 
of conditions. First, we address the o^- and <5$. Letting A denote an absorbing 
"cemetery" state, reached if a host dies, set 

(2.1) aiA ■= Si, an := -a*(i)-5~i, i > 0, 
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where we assume that a*(i) := J2j>o,j^=i ®ij < 00 • Then a is the infinitesimal 
matrix of a time homogeneous pure jump Markov process W on Z + U A. 
Writing 

(2.2) py(i) := P[W(t)=j\W(0) = i], 

for i > and j £ ZU A, we shall assume that a is such that W is nonexplosive 
and that 

(2.3) B°{(W(t) + 1)} = £(j + l) Pij (t) <(i+ l)e wt , i > 0, 

j>0 

for some w > 0, where we use the notation 

E°(/W))) := E{/(W(t))7[W(t)^A]|W(0)=i}. 
We shall further require that, for some 1 < mi,m2 < 00, 

(2.4) Q*(i)+<5 i <m 1 (i + l) m2 foralH>0, 
and also that, for each j > 0, 

(2.5) limsupajj < 00. 

Z— >oo 

The remaining elements depend on the state of the system through the ar- 
gument x := iV _1 £. In the random model, x € N _1 X has only finitely many 
nonzero elements, but when passing to a law of large numbers approxima- 
tion, this need not be appropriate in the limit. We shall instead work within 
the larger spaces 

(2.6) £ u := JxeR M :£(* + 1)1^1 < 00 1, 

{ i>0 J 

with norm ||x||n := Z)i>o(* + an d the usual l\ with norm ||x||i := 

Si>ol" c 1' We then assume that otn,(3i and <5j are all locally bounded and 
locally Lipschitz, in the following senses. First, for i > and 1,1/6 in, we 
assume that 

(2.7) «iz(0)<aoo, E (i + l)«iz(0)<(i + l)aio, 

(2.8) \ a u( x ) ~ a u(y)\ < Ooi(x,y)||a; - 1 , 

Z>0,Z^i 

(2.9) ^ (Z + l)|o«(aj)-a«(i/)| < (i + l)aii(x,7/)||x- y||n, 

Z>0,Z^i 

where the a r o are finite, 

a r i(x,y) < 5 r i(||x||u A ||y||n), r = 0, 1, 
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and the a r \ are bounded on bounded intervals. Then we assume that, for all 
x,ye in, 

(2.10) E( i + 1 )A(°)^ 6 io> 

j>0 

(2.H) ElA(a?)-A(y)|<6oi(^l/)||a:-i/||i, 

j>0 

(2.12) J2( i + 1 M( x )-^y)\<bn(x,y)\\x-y\\ u , 

i>0 

where the b r o are finite, 

b r i(x,y) <b r i(\\x\\n A \\y\\n), r = 0,l, 
and the b T \ are bounded on bounded intervals; and finally that 

(2.13) sup^(0)<4, 

(2.14) sup|^(x) -8i(y)\ < di(x,y)\\x - y\\x, 

i>0 

where do is finite and 

di(x,y) < di(\\x\\n A ||y||n), 

with d\ bounded on finite intervals. 

The various assumptions can be understood in the biological context. 
First, the two norms || • ||i and || • ||n have natural interpretations. The 
quantity ||£ — r}\\\ is the sum of the differences |£ l — r/ l \ between the numbers 
of hosts in states i = 0,1,2, . . . in two host populations £ and ij; this can 
be thought of as the natural measure of difference as seen from the hosts' 
point of view. The corresponding "parasite norm" is then ||£ — which 
weights each difference — rf\ by the factor (i + 1), the number of parasites 
plus one; in a similar way, writing x = -/V" 1 ^, one can interpret ||x||n as a 
measure of "parasite density." 

The simplest conditions are (2.7) and (2.13), which, together with condi- 
tions (2.8) and (2.14) with y = 0, ensure that the per capita rates of events 
involving infection and death are all finite, and bounded by constant multi- 
ples of the host density + 1. This is frequently biologically reasonable. 
For instance, a grazing animal, however hungry, can only consume a lim- 
ited number of mouthfuls per unit time, an infection event occurring when 
a mouthful contains infective stages of the parasite (of which there may 
be many, subject to the remaining conditions). Conditions (2.10) and (2.11) 
with y = can then be interpreted as limiting the overall immigration rate of 
hosts and the per capita host birth rate. Analogously, conditions (2.8), (2.11) 
and (2.14) for general y imply that cumulative differences in the above rates 
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between population infection states x and y are limited by multiples of the 
host norm \\x — y\\i of the difference between x and y, and also that these 
multiples remain bounded provided that the smaller of ||x||n and \\y\\n 
remains bounded. In Kretzschmar's (1993) model, and in others in which 
the per capita infection rate grows as a constant K times the parasite den- 
sity 1 1 x 1 1 11, these conditions are violated, and, without them, the coupling 
argument of Section 5 fails; however, realistic modifications of these models 
are covered; see Section 6. 

The remaining conditions concern parasite weighted analogues of the pre- 
ceding conditions. Conditions (2.10) and (2.12) with y = constrain the 
overall rate of flow of parasites into the system through immigration to 
be finite, and bounded if the parasite density remains bounded; condition 
(2.12) also limits the way in which this influx may depend on the infec- 
tion state. Conditions (2.7) and (2.9) impose analogous restrictions on the 
rates at which individual parasites can cause further infection, limiting the 
output per parasite. Bounds on the parasite weighted quantities are needed 
to establish the accuracy of approximation that we prove; see the remark 
following Lemma 4.3. Our choice of bounds allows considerable freedom. 

Note that the above discussion relates only to the state-dependent ele- 
ments in the transition rates. The conditions on the state-independent rates 
(2.3) and (2.4) have similar effects, but are less restrictive. For instance, the 
state-independent element in the death rates may increase rather rapidly 
with parasite burden (2.4), whereas the state-dependent elements are more 
strongly restricted (2.13), (2.14). The particular form of the conditions on 
the state-dependent rates is in part dictated by the coupling argument of 
Section 5, and may well not be the weakest possible for results such as ours 
to be true. Similarly, the condition (2.5) is of a purely technical nature, 
though presumably always satisfied in practice. 

Remark. The assumptions made about the otij{x) and (3{{x) have cer- 
tain general consequences. One is that the total number of hosts has to be 
finite almost surely for all t. This can be seen by comparison with a pure 
birth process, since the number of hosts ||X||i only increases through im- 
migration, and the total rate of immigration N J2i>o Pii^ -1 X) does not 

exceed Nbio + &oi(0)||^||i- Hence, for any T > 0, 
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for all i > 0. Hence, and because W is nonexplosive, it follows that, on 
the event {sup 0<t<T ||-Xjv(*)||i < NM}, the X-chain makes a.s. only finitely 
many transitions in [0, T\. Letting M — >oo, it follows from (2.16) that a.s. 
only finitely many transitions can occur in the X-chain in any finite time 
interval. 



3. The differential equations. We assume deterministic initial conditions 
Xn(0) for each iV; in fact, because of the Markov property, we could equally 
well let Xj\r(0) be measurable with respect to events up to time zero. Our 
aim is to approximate the evolution of the process N~ 1 Xj^(t) when N is 
large. A natural candidate approximation is given by the solution xn to the 
"average drift" infinite dimensional differential equation 

dx i (t) 

= x\t)a H + x l (t)a H (x(t)) - x\t) £ a u (x(t)) 

l>0 l^i tyi 

(3-1) 

+ f3 i (x(t))-x l (t)5 l (x(t)), i>0, 

with initial condition xn(0) = N~ 1 Xn(0). The following theorem, the main 
result of the paper, shows that xn indeed provides a suitable approxima- 
tion, and quantifies the ^i-error in the approximation. For convenience, we 
consider initial conditions that are close to a fixed element xq E 

Theorem 3.1. Suppose that (2.3)-(2.14) hold, and that x N (0) --N^Xm^) 
satisfies ||xjv(0) — xolln ~~ ► as N — > oo for some xo £ Let [0,t max ) de- 
note the interval on which the equation (3.1) with xq as initial condition has 
a solution x belonging to Then, for any T < i max , there exists a constant 
K(T) such that, as N — > oo, 



iV^ 1 sup \\X N (t)-Nx N (t)\\i>K(T)N- 1 / 2 log 3/2 N =0{N~ 1 / 2 ), 

0<t<T 



P 

where xn solves (3.1) with xat(0) = N~ 1 Xn(Q). 



Proving this theorem is the substance of this and the next two sections. 

First, it is by no means obvious that equation (3.1) has a solution, some- 
thing that is only proved in Theorem 4.2. And even if a solution exists, it 
is not perhaps immediate that it has to belong to the nonnegative cone, in 
contrast to the case of the stochastic model. To temporarily accommodate 
this possibility, we extend the definitions of an, Pi and <5j, setting 

au(u) = au(u + ), 5i(u) = 5i(u + ) , ueK^, 

where u\ := max(ii ! , 0), i > 0, and observing that conditions (2.8)-(2.14) are 
still satisfied, with x and y replaced by x + and y + , respectively, as arguments 
of oqi, oil) boi, bn and d±. 
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Temporarily suppressing the ^-dependence, note that equation (3.1), as 
in Arrigoni (2003), can be compactly expressed in the form 

(3.2) ^L = Ax + F(x), x(0) = N~ 1 X N (0), 
where A is a linear operator given by 

(3.3) {Ax) i = Y,x l au 1 i > 0, 

l>0 

and F is an operator given by 

(3.4) F{x) 1 = Y J x l a ll {x)-x i Y j a l i{x)+l3 l {x)-x i 5 i {x), i>0. 

If A generates a Co (strongly continuous) semigroup T(-) on a Banach 
space S, then every solution x of (3.2) with x(0) G S also satisfies the integral 
equation 

(3.5) x(t) = T(t)x(0) + [ t T(t-s)F(x(s))ds. 

Jo 

A continuous solution x of the integral equation (3.5) is called a mild solu- 
tion of the initial value problem (3.2), and if F is locally Lipschitz continuous 
and x(0) G S, then (3.5) has a unique continuous solution on some nonempty 
interval [0,t max ) [Pazy (1983), Theorem 1.4, Chapter 6]. In our case, A T is 
the infinitesimal matrix of the Markov process W, and by standard Markov 
theory [Kendall and Reuter (1957), pages 111 and 114-115], we can iden- 
tify T(t)x with P{t) T x for any i£<i, T being strongly continuous on l\. 
However, in order to establish the bounds that we have claimed, we need to 
take S to be the space We therefore show that T is also strongly con- 
tinuous in in, and that F is locally £n-Lipschitz continuous, from which 
the existence and uniqueness of a continuous solution in in to the integral 
equation (3.5) then follows. 

Lemma 3.2. T is strongly continuous in In. 

Proof. To prove this lemma, note that every sequence x such that 
||x||n < oo can be approximated in in by sequences with bounded support, 
which are all in the domain of A, so D(A) is dense in in. We now need to 
check that T(t) in C in, and that T is £n-strongly continuous at 0. First, 
for every x with ||x||n < oo, we have ||P(t) T x||n < oo for all times t from 
(2.3), since 
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(3.6) 



\p(t) T x\\ 11 = J2U + K 



i>0 



<2|x i |(t + l)e ,0 * = e ,0t ||x||ii < 



oo. 



i>0 



For strong continuity, taking any x € i\\ , we have 
||P(t) T a;-a;||ii 



j>0 I ift ) 

= j]T \x*\{B°(w(t) + 1) - (i + i)Pi[w(t) = ;]} 
+ £(7 + 1)1^1(1-^))}. 



. j>0 



Now lim t ±oPjj(t) = 1 for all j, and, by (2.3), 

< E?(W(t) + 1) - (i + l)Pi[W(t) = i] < (i + l)e™' 

and 

limsupE°(W(i) + 1) < lim(i + l)e wt = i + 1; 
40 tJ-0 

i 

liminf E?(W(t) + 1) > liminf V pa(t)(j + 1) = f + 1. 
no t|o f^, J 

Hence, since ||x||ii < oo, lim^g \\P(t) T x — x\\u = by dominated conver- 
gence. □ 

Lemma 3.3. The function F defined in (3.4) is locally Lipschitz contin- 
uous in the i\\-norm. 

Proof. For x,y > such that ||x||n, ||y||n < M, using assumptions 
(2.7)-(2.14), we have 

\\F(x)-F(y)\\u 

< + 1) £ \ xla u( x ) - y la n(y)\ 

i>0 l^i 
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where 



(3.7) 



i>0 1^4 tyi 

+ £(i + i)\/3i(x) - #(y)| + £(i + l)M0*0 - y^(y)| 



i>0 



j>0 



< W - y l \ + i)a u (x) + J2y l J2(i + l )M x ) - 

+ j2( i + 1 )\ xi -y i \J2 a u( x ) + £(* + X V X M*) - Mi/)! 

i>0 Z/j i>0 Z^i 

+ b n (x,y)\\x - y\\ n + + - y*|fft(^) 

i>0 

+XK i + 1 )f i i*i(^)-wi 

< {aio + 5n(0)||x||ii}||x - ii +an(M)||x - y\\ n \\y\\n 
+ {a o +aoi(0)}||a;||i ||x-y||n + a i(M)||y||u ||x-y||i 
+ 6n(M)||x-y||i 1 

+ {d o + di(0)}\\x\\i \\x - y\\ n + d x {M)\\x - y\\i ||y||n 

< F M ||sc - y||u, 

F A f := ai O + a u (0)M + M5u(M) 
+ «oo + aoi (0)M + M5 i (M) 
+ &n(M) + d + di(0)M + Mdi(M). 



□ 



From these two lemmas, it follows that the differential equation system 
(3.1) has a unique weak solution, so that we at least have a function xm 
to give substance to the statement of Theorem 3.1. It is later shown in 
Theorem 4.2 that, under our conditions, x^ is in fact a classical solution to 
the system (3.1). 

Our main aim is to approximate a single random process N~ 1 Xn by the 
solution xj\r of (3.5) with initial condition xjy(0) = N~ 1 Xjy(0), and to give 
an error bound that is, in principle, computable. However, in order to fix the 
definitions of the constants appearing in the error bounds, we have framed 
Theorem 3.1 in terms of a sequence of processes indexed by N, assuming that 
N~ 1 Xn(0) — ► xq in £n as N — > oo, for some xo € in- It is then natural to 
be able to approximate all of the processes N~ 1 Xn by the single solution x 
to (3.5) which has x(0) = xq. The next lemma shows that this poses no 
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problems, because the solution of (3.5) depends in locally Lipschitz fashion 
on the initial conditions, within its interval of existence. 

Lemma 3.4. Fix a solution x to the integral equation (3.5) with x(0) G 
in, an d suppose that T < i max - Then there is an e > such that, if y is a 
solution with initial condition y(0) satisfying \\y(0) — x(0)||n < e, then 

sup \\x(t) - y(t)\\ u < \\x(0) - y(0)\\ n C T , 

0<t<T 

for a constant Ct < oo. 

Proof. From the integral equation (3.5) together with (3.6), it follows 
that, if ||sc(0) - 2/(0) ||ii < e, then 

\\x(t) - y(t)\\ n < ee wt + /* F 2Mt \\x{s) - y(s)\\ne w ^ ds, 

where Fm is defined in (3.7) and 



o 



M T := sup ||x(t)||ii, 
0<t<T 

provided also that sup < t <T ||y(t)||n < 2Mt- By Gronwall's inequality, it 
then follows that 

sup \\x{t) - y(t)\\ n < \\x(0) - y(0)||nC T < eC T , 

0<t<T 

for a constant Ct < oo. This implies that sup 0<t<T ||?/(i)||ii < 2My is indeed 
satisfied if e < Mt/Ct, and the lemma follows. □ 

Corollary 3.5. Under the conditions of Theorem 3.1, if also ||xtv(0) — 
x o||n = 0(N^ 1 ^ 2 ), then xjy can be replaced by x in the statement, without 
altering the order of the approximation. 

Proof. Combine Theorem 3.1 with Lemma 3.4. □ 

It also follows from Lemma 3.4 that, if ||xjv(0) — ^ollii ~~ > 0, then for all N 
large enough 



(3.8) 



sup \\x N (t)\\ii- M T 

0<t<T 



< \\x N (0) - XqWuCt, 



provided that T < i max - In particular, if t^ax denotes the maximum time 
such that xn is uniquely defined on [0,i^( ax ), then liminfTv^oo ^max — ^max- 
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4. The independent sum approximation. The next step in proving Theo- 
rem 3.1 is to consider an approximating model Xj^(-), starting with 
-Xjv(O) = -Xjv(O), and consisting of independent individuals. Each individ- 
ual's parasite load evolves according to a time inhomogeneous Markov pro- 
cess W on Z + U A with infinitesimal matrix defined by 

qij(t) = aij + aij(t), j¥=h\ l>0, 

(4.1) »i(*) = -E«i (*)-*' 

QlA(t) = S l + 8i(t), l>0, 

where 

(4.2) aa(t):=aa{x N (t)); := 5i(x N (t)); 
and, for i, j € Z + U A, we shall write 

(4.3) Pi^t) := P[W(t)=j\W(s) = i], s<t. 

In addition, individuals may immigrate, with rates N(3i(t), where 

(4.4) k(t):=f3i{x N {t)). 

The process X^ differs from in having the nonlinear elements of the 
transition rates made linear, by replacing the Lipschitz state-dependent ele- 
ments OLij{x), Pi(x) and 5i(x) at any time t by their "typical" values aij(t), 
$i(t) and Si(t). Our strategy will be first to show that the process X^ stays 
close to the deterministic process Nxj^(t) with high probability, and then 
to show that, if this is the case, then Xn also stays close to Xn, again with 
high probability. However, we shall first use the process Xn to improve our 
knowledge about the weak solution x to (3.2). 

Let us start by introducing some further notation. Fixing T < t max , define 

(4.5) M T := M T (x) := sup V(» + l)|x^)|; 

0<*< r i>l 

(4.6) G T :=G T (x):= sup I > 

and write M$ := M T (x N ), Gj. := G T (x N ). Note that M T is finite if s(0) G 
^n, because the mild solution x is i\\ -continuous, and that > 1 and 
> 1 whenever ||xjv(0)||i = 1, as is always the case here. 
It is immediate from Lemma 3.4 that if ||xjv(0) — x(0)||n — > 0, then 
Mj? < Mt + 1 for all N large enough. Furthermore, it then follows that 

(4.7) < <M T + l 
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for all N sufficiently large. Hence, using also dominated convergence, we 
deduce that < Gt + 1 for all TV" sufficiently large. 

Our first result of the section controls the mean of the process N~ l X^ 
in the ^n-norm. 

LEMMA 4.1. Under conditions (2.3)-(2.14), for any X N (0) £ 4i and 
any T < i^ax; we have 

sup N~ 1 'S2(l + l)EX l N (t) 

o<t<T ,> 

<{^- 1 ||X 7 v(0)|| 11 +T(6 10 + 6ii(0)M^)}e( u ' +a S+ a i M T)r < oo, 
where w is as in assumption (2.3), ag := «io — «oo an d a i := ^ii(O) — 5oi(0). 

Proof. Neglecting the individuals in the cemetery state A, the pro- 
cess Xn can be represented by setting 

Xif(P) Ri(t) 

(4.8) X N (t)=J2 E e(^(t))+E £ eiW^t-r^)), 

i>0 j=l i>0 j=l 

where Wij and W-j, i > 0,j > 1, are independent copies of W, with W^ 

and W(j starting at i, and the Ty, j > 1, are the successive event times 
of independent time inhomogeneous Poisson (counting) processes Ri with 
rates N$i(t), which are also independent of all the Wij and W[p as usual, 
e(l) denotes the /th coordinate vector. Hence, it follows that, given Xn(0), 



l>0 

Y,U l N (m°{W(t) + 1} + N J' Un)^i{W{t - u) + 1} du 



i>0 1 • / ° 

where E° is as defined for (2.3). 

Now W has paths which are piecewise paths of W, but with extra killing 
because of the 5i(u) components of the rates, and with extra jumps, a-jumps, 
say, because of the 6iij{u) components. The killing we can neglect, since it 
serves only to reduce E^W(i). For the remainder, by assumption (2.8), the 
rate of occurrence is at most x := {°oo + 5oi(0)My }, irrespective of state 
and time. So, defining 

Cjj(it) := aij(u)/x, i,3 > 0, j + v, 

cu(u) :=l-^2cij(u), i>0, 

3& 
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we can construct the a-jumps by taking them to occur at the event times 
of a Poisson process R of rate x> with jump distribution for a jump at 
time u given by q.(u) if W(u— ) = i. Note that, in this case, no jump is 
realized with probability cu(u). Conditional on R having events at times 
< t\ < ■ ■ ■ < t r < t between and t, we thus have 

F${W(t)+l\h,...,tr} 

^ E PijM E c hh(tl) E Pliia(*2 -tl) E c J2/ 2 (*2) • • • 
ji>0 h>0 j 2 >0 h>0 

Plr-ljritr - t r -l) E ^rJr (*r) E^ + 1 )Plri(* ~ *»0> 
>>0 l r >0 j>0 

where, as before, Pij(t) = P[W(t) = j \ W(0) = i]. Applying assumptions 
(2.3), (2.7) and (2.9) to the last two sums, we have 

E C irlM $>' + l)pi rJ (t " tr) < E ^rfcfrOGr + l)e^) 

l r >0 j>0 l r >0 

<a^( Jr + l)e w ^\ 
where := {oio + an(0)Mjf}/x- It thus follows that 

E°{ W(t) + l\h,...,t r }< a^{W(t r ) + 1 | ti,..., ^K^). 
Arguing inductively, this implies that 

E°{W(i) + 1 | h,...,t r } <(i + l){af }V, 
and hence, unconditionally, that 

E?{W(t) + 1} < (» + l)e u,i E{(af 

(4.9) 

<(i + l)exp{( W + (af -l)x)*}. 
The remainder of the proof is immediate. □ 

Armed with this estimate, we can now proceed to identify N~ 1 ~EX]y(t) 
with xj\i(t), at the same time proving that the mild solution x n is in fact a 
classical solution to the infinite differential equation (3.1) with initial con- 
dition N- l X N (0). 

First, define the "linearized" version of (3.1): 

^ = Y,y l it)a u + J2y l (t)a H (t) - y\t) E S«(t) 

1>1 Z>0 Z>0 

(4-10) „ . _ 

+ &{t)-y t (t)6 i (t), i>0, 
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where a, (3 and 5 are as in (4.2) and (4.4), to be solved in t G [0, T] for an 
unknown function y. Clearly these equations have xn itself as a mild solution 
in in, and, by Pazy (1983), Theorem 3.2, Chapter 6, the mild solution is 
unique under our assumptions, since now 

F(t,uY := ^u% i (t)-u i ^cu(t)+fc(t)-u i 6i(t) 

l>0 l>0 
is in locally Lipschitz in u G S with constant 

«oo + o 10 + d + M$ {a i (0) + a u (0) + di (0)} 

and 

\\F(s,u) - F(t,u)\\u 

<(Mf{a 11 (M^) + 5oi(M^) + J 1 (Mf)} + 6 11 (M^))||x 7V (s)-x 7 v(t)||ii, 

whenever ||"u||n < M^Y, so that F is i-uniformly continuous on bounded 
intervals (contained in [0,t^( ax )), because xn is. We now show that y{t) = 
iV -1 EZjv(i) solves (4.10), and indeed classical solution. Since it also 
therefore solves (3.5), and since this equation has a unique solution, it follows 
that y is the same as ijv, and that it is the classical solution to equation 
(3.1) with initial condition N~ 1 X^(0). 

Theorem 4.2. Under conditions (2.3)-(2.14), for any fixed X N {Q) G 
in, the function y(t) := N~ 1 'EX^(t) satisfies the system (4-10) with initial 
condition N~ 1 Xn(0) on any interval [0,T] with T <t^ lSLX . It is hence the 
unique classical solution xn to (3.1) for this initial condition. 

Proof. Let X J N1 (t) denote the number of particles present at time 
that are still present and in state j at time t; let X J N2 (t) denote the number 
of particles that immigrated after time and are present and in state j at 
time t. Then 

EXi,(t)=EX{ 1 (t)+EX{ 2 (t) 

(4-11) 

= Y, X N (0)py (0, t) + N f Y,Pi (u,t)du, 

i>0 J ° i>0 

where pij(u,v) is as defined in (4.3). Note that the expectations are finite, 
since, from conditions (2.10) and (2.11) and by (4.7), uniformly in u G [0, T], 

(4.12) ^/3 l (n)<6 10 + 6 01 (0)G^<oo. 

i>0 
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Then, denning qji(t) as in (4.1), it follows for each j > that the quantities 
—qjj(t) are bounded, uniformly in t, since by conditions (2.13), (2.14) and 
(2.8), 

(4.13) 5j(t)<do + di(0)Gr; E«i«(*) < «oo + 6 i(0)G^, 

and since a*(j) = J2i^j ®ji is finite. Note further that, from the forward 
equations, 

—Pij(u,t) = 22pu(u,t)qij(t); 
m l>0 

see Iosifescu and Tautu (1973), Corollary to Theorem 2.3.8, page 214. 
Now we have 

i>0 

(4.14) =X' N (0) + Y / X i N (0) fY^PilMqiji^du 

i>0 J ° Z>0 

t 



l>0 







with no problems about reordering, because of the uniform boundedness 
discussed above, and since only one of the qij is negative. Then also, with 
rearrangements similarly justified, we define 



Qt := N f \ f V/3j(u) y2pu(u,v)qij(v)du \ dv; 
Jo l> l >0 l>0 ) 

taking the i-sum first, and then the u-integral, we obtain 

Qt = [ J2 EX N2(v)qij(v)dv; 
Jo l>0 

taking the £-sum first, we have 

rt f rt Q 



Qt = N |^ ^h{u)^Pij{u,v)l [a ^{u)du^ dv 

= N [ y2f3i(u){pij(u,t) - pij(u,u)} du 

= BX J N Jt)-N f 0Au)du. 
Jo 
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From these two representations of Qt, it follows that 

(4.15) BX 3 N2 (t) =J^N0j(u) +jyX l m (u) qij ( U )^ du; 

combining (4.15) and (4.15), we thus have 

(4-16) 

= N^X^O) + J^k( u ) + J2 N ~ lE X l N (u) qij (u)^ du. 

Since the right-hand side is an indefinite integral up to t, it follows that 
iV _1 EX^(t) is continuous in t, for each j. The quantities qji(t) are all con- 
tinuous, because xn is in -continuous in t and the ctu{x) and 5i{x) are 
Lipschitz, and also, for qu(t), from assumption (2.8). Then, from Lemma 4.1, 
we also have 

J2 E4W < ( J + l) _1 {|l^(0)||ii + NT(b w + ~b 11 (0)M^)}e^ +a o+ a i M ^ T , 
j>J 

so that, in view of assumption (2.5) and of (4.13), the sum on the right-hand 
side of (4.16) is uniformly convergent, and its sum continuous. Hence (4.16) 
can be differentiated with respect to t to recover the system (4.10), proving 
the theorem. □ 

Our next result shows that, under appropriate conditions, N~ 1 Xj^(t) 
and x(t) are close in ^i-norm at any fixed t, with very high probability. 

Lemma 4.3. Suppose that conditions (2.3)-(2.14) are satisfied, that 
A"j\r(0) £ in an d that N > 9. Then, for any t 6 [0, T] with T < £^ ax , 



B\\X N (t) - Nx N (t)\\i < 3(Mf + l)VJVlogJV. 

Furthermore, for any r > 0, there exist constants Kr > 1 and Kr such 
that 

PDI-M*) - Nx N {t)\\i > kV>{M$ + lJJV^log 3 / 2 N] < kPg^n-*. 

Proof. For a sum W of independent indicator random variables with 
mean M, and for any 5 > 0, it follows from the Chernoff inequalities that 

(4.17) max{P[W > M (1 + 8)],P\W < M(l - 5)}} < exp{-M<5 2 /(2 + 5)}; 

see Chung and Lu (2006), Theorem 4. Now the quantity X~L(t) can be ex- 
pressed as a sum of independent random variables , . . . , Yjy and V', where 
Y£ is the indicator of the event that the kih initial individual is in state j at 
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time t, and Y' is a Poisson random variable with mean N L >~2i>o Pi ( s )Pij ( s > t) ds. 
Hence, by the simple observation that Yax(X N (t)) < ~E(X N (t)) = Nx l N (t), 
we have 

(4.18) E|XV(t) - Nx N (t)\ < y/Nx%(t) A {2Nx N (t)}, 
and, by (4.17), for any a > 2 and N > 3, we have 

P[|*M*) - Nx%(t)\ > a^Nx N (t)logN] < 2N~ a '\ 

so long as Nx l N {t) > 1. By Lemma A.l(iii), in view of (4.18), it now follows 
immediately that 

~E\\X N (t) - Nx N (t)\\i < 3(-Mt + iWNlogN, 

giving the first statement. 

For the second, let In if) '■= {i :x n(^) — V-^}- Then it is immediate that 
|/jv(*)| <NG$, so that, if 

B N (t) := p| {\X N (t) - Nx N (t)\ < a^Nx N (t)logN}, 

i£l N (i) 

then 

(4.19) V[B N (t)} < 2G$N 1 ~ a / 2 . 

On the other hand, on the event Bj\r(t), it follows from Lemma A.l (i) that 



Y \X N (t) - Nx N (t)\ < a logiV Y, \JNx N (t) 
iei N (t) iei(t) 

(4.20) 



<alogiV(Mf +l)yWk^N. 

For the remaining indices, we note that SW(*) := 2~2igi N (t) X N {t) is also a 
sum of many independent indicator random variables plus an independent 
Poisson component. Using (4.17), we thus have 

p s N (t)> Y a^5v(*) + ^ 1/2 (^t +i)Viogiv 

(4.21) 

< exp{-N 1/2 (Mfi + 1) v^ogiV/3} < exp{-Af 1/2 /3}, 

since 5 := N l l 2 (M^ + l)y/TogN/ T,i$i N (t) ^jv(*) > 1 from Lemma A.l(ii); 
otherwise, we have 

(4.22) Y X l N (t)<2N 1 / 2 {M^ + l)^g~N, 

i£/jv(*) 

again from Lemma A.l (ii). Now, fixing any r > and taking a = 2(r + 1), 
the second part of the lemma follows from (4.19)-(4.22). □ 
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Remark. The argument above makes essential use of the finiteness 
of MJp, through Lemma A.l. For < oo, we needed the solution xjy to 
(3.5) to be ^n-continuous, and hence needed to consider equation (3.5) with 
respect to Now, for the accuracy of approximation given in Lemma 4.3, 
the condition < oo is in fact close to being necessary. To see this, note 
that, if Zi ~ Vo{Npi) for i > 1, for some choice of pi, then 

J2 E|Z 4 -iV Pi | ^ N 1 ' 2 J2 VPl- 

i: Pi >l/N i: Pi >l/N 

In the arguments above, we have the x l N (t) for the pi, with the X' l N (t), which 
are close to being Poisson distributed, in place of Zi, and we use the fact 
that J2i>iiPi < 00 • Suppose instead that pi ~ cz _1 ~ r? for some c > and 
< r] < I, so that J2i>i i n Pi = 00 • Then it follows that 

£ X ^(1-^/2(1+.) ; 
v. P i>X/N 

and hence, that 

^Tei^-a^i > ^1/2^(1-^/2(1+^) 

j>l 

Thus, for such pi, an approximation as close as that of Lemma 4.3 cannot 
be attained, because the mean £i-distance would be at least of order as big 
as iV 7 for 7 = 1/(1 +77) > 1/2. Note that such circumstances would arise in 
our model, if we took, for instance, ce^j-i = i, i>2, 5\ = 1, (3i(x) = 
for all x, and set all other elements of the transition rates equal to zero. The 
resulting stochastic model has no interactions, so that the processes Xn 
and Xjy have the same distribution, that of a particular multitype Markov 
immigration-death process. At stationarity, the mean number pi of individ- 
uals of type i satisfies pi ~ ci~ 1 ~ v as i — ► 00, for c = I/77. Of course, with 
this choice, the $(0) violate condition (2.10). 

The next lemma is used to control the fluctuations of Xn between close 
time points. We define the quantity 

H$ := 2 m *- 1 ?n 1 + {&i + a 00 + 601 (0) 

+ d + G^(aoi(0)+d 1 (0))}/\NM^r 2 -\ 

which will be used as part of an upper bound for the transition rates of the 
process Xjy on [0, T] , noting that 

(4.23) 1 < < H*Mt, 

where H* does not depend on T or N. 
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Lemma 4.4. Suppose that conditions (2.3)-(2.14) are satisfied. Then, if 
h<l/(2\NMJV] m2 Hfi),t<T-h, andifWXN^-NxN^WiKKN^log 3 / 2 
it follows that 



sup \\X N (t + u) - X N {t)\\i > KN 1/2 log 3/2 N + alogN 

0<u<h 



P 

for any a > 2 and N > 3. 



Proof. At time t, there are ||X/v(t)||i individuals in the system, each 
of which evolves independently of the others over the interval s£ [t,t + h]; 
in addition, new immigrants may arrive. During the interval [t,t + h], an 
individual in state i > at time t has probability 

exp< — lh(a*(i) + Si) + / Si(x(t + u)) du + I Y ctu(x(t + u)) du) > 

IV Jo Jo & J) 

of not changing state; and the expected number of immigrants is Poisson 
distributed with mean 

rh 

I du. 



NY, I Pi(x(t + u))> 



Now consider the total number R(t, h) of individuals that either change 
state or immigrate during the interval [t,t + h]. For each i > 0, the individ- 
uals in state i at time t can be split into two groups, the first containing 
X l N (t) A Nx l N (t) randomly chosen individuals, and the second containing 
the remainder. Adding over i, the numbers in the second group add up 
to at most KN 1 / 2 log 3 / 2 N, by assumption. Then, from the observations 
above, the mean number of individuals in the first group that change state 
in [t, t + h] is at most 



(4.24) 



2\NM*T\-1 

Y hNx^mxii + l)^ + Y, Nx N(t) 

i=0 i>2\NM^] 

+ YhNx%(t){d + di(0)G^ + a 00 + a i(0)G^} 

i>0 



from (2.4) and (2.7)— (2.14). Finally, the expected number of immigrants in 
[t, t + h] is at most 

(4.25) hN{b 10 + boi(0)G$}. 

Adding (4.24) and (4.25), and recalling the assumption on h, we obtain an 
expected number of events in these categories of at most 

hNM^mx{2\NM^ l}™ 2 " 1 + \ 
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(4.26) + hN{b w + G$(d + aoo + 601 (0)) + [G£]Vi(0) + 5oi(0))} 
<i + h\NMF-\ ma H$<l. 

Applying the Chernoff bounds (4.17), the probability that more than alogN > 
2 events of these kinds occur in [t, t + h] is thus at most A~ a / 6 , implying in 
sum that 

P[R(t,h) > K N 1 / 2 log 3 / 2 N + a log A] < N~ a / 6 . 
Since sup < u </i ||Ajv(i + u) — X^-(t)\\i < R(t, h), the lemma follows. □ 

We are now in a position to prove the main result of the section, showing 
that the independent sum process N~ 1 Xn is a good approximation to xjy, 
uniformly in [0, T]. 



Theorem 4.5. Under conditions (2.3)-(2.14), and for T <t^ ia _ yi , there 

;ist co 
enough, 



exist constants 1 < , Kr^ < oo for each r > such that, for N large 



sup WN^Xtftf) -sjv(t)||i > (M? + F m n)N~ 1 / 2 log 3 / 2 A 

0<t<T T 



<K^(M^) m2+2 N- r , 
where Fm is as in (3.7). 

PROOF. Suppose that N > 9. Divide the interval [0, T] into \2T ] ™ 2 

H$] intervals [ti,t I+1 ] of lengths hi = t i+1 -ti < l/(2\NM^] m2 H$). Apply 
Lemma 4.3 with r + m 2 for r and with t = t\ for each /, and apply Lemma 4.4 
with a = 6(r + m^) an d with t = t\ and h = hi for each I; except on a set of 
probability at most 

\2T\NM$] m2 H$] {K {2 1 G%N~ r - m2 + A" r - m2 ), 



we have 



sup \\N X]Sf(t)—XN 
0<t<T 

^(1) 



(4.27) < 2K% rr ^(M$ + l)iV~ 1/2 log 3 / 2 N + 6(r + m 2 )N~ 1 log A" 



+ sup ||xjv(s) — Xn 

0<s,t<T;\s-t\<l/(2\NM^] m2 H$) 

Now, since x^ satisfies (3.5), it follows that, for < u < h, 
\\x N (t + u) -XN{t)\\i < \\x N (t)P(u) -x N (t)\\i 

+ rWFixNit + v^Pih-v^dv. 
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By (3.6), we have 

\\F(x N (t + v))P{h - t,)|| u < e wu \\F{x N {t + t,))|| u < e wh F m n , 
the last inequality following from Lemma 3.3, so that therefore 



\\F{x N {t + v))P{h - v^dv <he wh M$ F m n < 



wT 77 



' j t - 2N 

for h < l/{2\NM^] m2 H^}, since m 2 > 1 and H$ > 1. Then 
Haw (t)P(u) - xjv (t) || i < 2 £ |4, (t) | (1 - Pjj (u) ) , 

j>0 

and, by part of the calculation in (4.26), we find that 

J2W N (t)\(l- Pjj (u))<u £ w N (t)\miU + ir 2 + E i4(*)i 

i>o i=o j>2\NM*-\ 

<hm 1 (2\NM^] m2 - 1 )M^ + ^, 

which is at most 3/(2iV) if ft < l/(2\ NM$ ] m2 H^). 

Hence, the third term in (4.27) is of order (1 + F m n)N~ 1 under the 
conditions of the theorem, and the result follows. □ 



5. The main approximation. We now turn to estimating the deviations 
of the process from the actual process Xn of interest. We do so by 
coupling the processes in such a way that the "distance" between them 
cannot increase too much over any finite time interval. In our coupling, we 
pair each individual in state i > 1 in Xn(0) with a corresponding individual 
in state i in X^(0) so that all their ^t- and J-transitions are identical. This 
process entails an implicit labeling, which we suppress from the notation. 
Now the remaining transitions have rates which are not quite the same in 
the two processes, and hence, the two can gradually drift apart. Our strategy 
is to make their transitions identical as far as we can, but, once a transition 
in one process is not matched in the other, the individuals are decoupled 
thereafter. For our purposes, it is simply enough to show that the number 
of decoupled pairs is sufficiently small; what pairs of states these individuals 
occupy is immaterial. 

We realize the coupling between Xn and Xn in terms of a four component 
process Z(-) with 

Z(t) = ((Zi(t), i > 0, 1 < I < 3), Z A (t)) G X 3 x Z+, 

constructed in such a way that we can define Xjy(-) = Z\{-) + ^(O and 
X N (.) = Zi(-) + Z 3 (-), and starting with Zi(0) = X N (0) = X N (0), 
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Z 2 (0) = Z 3 (0) = G X , and Z 4 (0) = 0. The component Z 4 is used only to 
keep count of certain uncoupled individuals: either unmatched ^-immigrants, 
or Z 3 individuals that die, or Z2 individuals created at the death of (one 
member of) a coupled pair. The transition rates of Z are given as follows, 
using the notation ei(i) for the ith coordinate vector in the Zth copy of X, 
and writing X = Z\ + Z 2 . For the a- and a-transitions, at time t and for 
any i 7^ I, we have 

Z -> Z + (ei(0 - ei(i)) at rate Z{{a u + {a^N^X) A a u (x N (t)))}; 

Z Z + (e 2 (Z)+e 3 (i)-ei(i)) at rate ^{^(JV^X) - a«(xjv(t))} + ; 

Z Z + (e 2 (i)+e 3 (/)-ei(i))atrate^{ai i (A r ~ 1 X)-a ii (^(t))} _ ; 

Z -»■ Z + (e 2 (0 - e 2 (i)) at rate Z l 2 {a u + o« (iV -1 *)}; 

Z ->■ Z + (e 3 (Z) - e 3 (i)) at rate Z^oy + ay (zjv (*))}, 

with possibilities for individuals in the two processes to become uncoupled, 
when N~ l X 7^ x(t). For the immigration transitions, we have 

Z -» Z + ei(t) at rate N^N^X) A fc(x N (t))}, i > 0; 

Z -» Z + e 2 (i) +e 4 at rate N^N^X) - (3i(x N (t))} + , i>0; 
Z Z + e- i {i)&ti&teN{(3 i {N~ 1 X)-(3 i {x N {t))Y, i > 0, 



with some immigrations not being precisely matched; the second transition 
includes an e 4 to ensure that each individual in Z 2 has a counterpart in 
either Z 3 or Z4. For the deaths, we have 



z - 


-> z- 


ei(z) at rate Z{{^ + (^(JV -1 ^ A ^(^(i)))}, 


i>0; 


z - 


-> Z- 


e x {i) + e 3 (t) at rate Z^N^X) - Si(x N (t))}+, 


i > 0: 


z - 


-> z- 


ei(i) + e 2 (i) +e 4 at rate Zj-^iV^X) - 5i(xjv(t))}~ , 


i > 


z - 


-> z- 


e 2 (i) at rate Z|{^ + ^(iV" 1 *)}, 


i > 0; 


z - 


■» z- 


e 3 (i) + e 4 at rate Z|{5j + Si (x n (*))}, 


i>0, 



where Z 4 (-) is also used to count the deaths of uncoupled Z 3 -individuals, 
and uncoupled deaths in Xn of coupled Z\ individuals. With this joint 
construction, we have arranged that 



(5.1) 



]rzi(i)<z 4 (i) + £zl(i) 



i>0 i>0 



for all t, and that 



(5.2) 



V N (t) := Z 4 (t) + $>3(*) 



i>0 
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is a counting process. We allow unmatched deaths in the ^-process. We 
thus have the bound 

\\X N (t) - X N (t)\\i = \\(Zi(t) + Z 2 (t)) - (Z^t) + Za(t))\\i 
(5.3) <Y,{Z\(t) + Z\{t)} 

i>0 



<2lz 4 (t) + J2Z 3 (t)\=2V N (t), 

{ i>0 J 



for all t, by (5.1). 

Now Vn has a compensator An with intensity ajv, satisfying 

aN(t) = £ ZOt^lauiN^XNit)) - a a {x N {t))\ 

i>0 l>0 



+ Nj2\Pi(N- 1 X N (t)) - Pi(x N (t))\ 

i>0 

+ Zi(t)\6 i (N- 1 X N (t)) - 5i(x N (t))\ 

i>0 

<E^)EM^ _1 ^W)-«^(t))i 

i>0 l>0 

+ NY,\Pi(N- 1 X N (t)) - pi(x N (t))\ 

i>0 

+ Y.^N^MN-'XNit)) - 5i(x N (t))\. 

i>0 

Now, condition (2.8) implies that, uniformly in i, 

lauiN^XNit)) - a u (x N (t))\ < a i ( (0 II n ) ll^^^iv (*) - x N 

l>0 

Hence, 

N^a^t) < (^xV(t)5 i(M^) + 6oi(M^) + ^xV(tK(M^)) 

\i>0 i>0 ) 



1- 



x \\N- x X N {t)-x N {t)\\ x 
+ 1 1 N~ 1 X N (t ) - x N (t ) 1 1 1 (ooi (M^Y ) + ~d x (M^Y ) ) 
x WN^Xn^-xn^IU 
< {H^+H^WN-'Xxit) - XNWWdWN^XNit) - x N (t)\\u 
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where 

H$' N) =G$a 01 (MF) + b 01 (MF) + G$d 1 {MF); 

Hp N) = d 01 (M^ ) + dx{M$). 
In particular, denning 

t n := mfityo-.WN^XNit) - XN^yi}, 

it follows that 

iV _1 ajv(t A tat) 

(5.4) 

< {H$' N) + H^'yWN^XNit A r N ) - x N (t A rjv)||i. 

For the next two lemmas, we shall restrict the range of N in a way that is 
asymptotically unimportant. We shall suppose that N satisfies the inequal- 
ities 

(i< + F m n)N~ 1 / 2 log 3 / 2 N<1; 

(5.5) 

N > max{(^ 3 )) 1/(m2+2) M^,9}, 
where ro = rri2 + 2, and the quantities K^ 1 and Kr are as for Theorem 4.5. 

Lemma 5.1. Under conditions (2.3)-(2.14), for any te [0,T] and for 
all N satisfying (5.5), we have 

N-^\\X N (t A t n ) - X N (t A ty)||i 

Proof. Write Mn(-) '■= Vjy(-) - A N (-). Then, because also 
WN^XnW-xnWWi. 

(5-6) 

< {{N^X^t) - TV-^jv^lll + " x N (t)\\ u 

and using (5.4), we have 
(2N)- 1 \\X N (t A rjv) - X N (t A 7*)||i 

< iV - x V{t A riv) 

< N~ x M.^(t A tjv) 

+ r TN {Hp N) + Hp N) }{N~ l \\X N {s A tat) - Xjv(* A T^Hi 
(5-7) 7 ° 

+ ||iV l X N (s At n ) - x N (s AT N )\\i}ds 
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+ f{H^ N) + Hp N) }{N~ l \\X N {s A r N ) - X N (s A t*)||i 
Jo 

+ \\N~ 1 X N (s A tat) -x n (s Ar N )\\i}ds. 
Now M.n(- Attv) is a martingale, since, by (5.4), 

E|y ajv(s)dsj 

< {H£' N) +Hp N) }Ey* A ™ \\X N (s)-Nx N (s)\\ 1 ds^ 

<{H^ N) +Hp N) } ( t {Y.\\X N (s)\\x+N\\x N {a)\\ x }d a < oc, 
Jo 

this last using (2.15). Taking expectations, it thus follows that 
(2N)- 1 B\\X N (t A t n ) - X N (t A TAr)[|a 

(5.8) < /'VT^ ) +^T' JV) }{^ 1 E||X J v( S Ar J v)-X iV r( S Ar J v)||i 

JO 

+ E||iV~ 1 XAr(s At n ) - x N (s ArAr)||i}ds. 

Now we have 

E||iV- 1 X i v(s A rjy) - x N (s A rjv)||i 

= ElHiV-^jvOO - Xiv(s)||i/[rjv > a]} 

(5.9) + ElUiV-^jvM - a;* far)||i/fav < s]} 

< EUAT-^jvCs) - xjvWIIi + (1 + W)Pfav < a] 

< EUAT-^jvCs) - xjv(a)||i + (1 + l/iV)P[rjv < T]. 

For iV satisfying (5.5), the first term in (5.9) is bounded by 3(M^ + l)^ 1 / 2 > 
ydogiV by Lemma 4.3. Also, for such N, the event {tn < T} lies in the ex- 
ceptional set for Theorem 4.5 with r = vq, implying that 

(5.10) P[t n <T}< K${MF) m2+2 N- ro , 

so that the second term is no larger than M$ N'^^/logN if 

N > max{(K( 3 )(Aff) m2+1 ) 1/(n, " 1/2) ,9}, 
which is also true if (5.5) is satisfied. This implies that, for such N, 

(5.11) E||iV~ 1 Xjv(s Ar^v) -^(sAr^v)!!! < 7M^A^ 1/2 VlogiV. 

Using (5.11) in (5.8) and applying Gronwall's inequality, the lemma follows. 

□ 
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Lemma 5.2. Under conditions (2.3)-(2.14)> f or an V t e [0>^1 an d y>0, 
and for all N satisfying (5.5), we have 



sup |Ar\Mjv(sATiv)|>y <g{t{Hp N) +Hp N) ))M$y- 2 N-V 2 ^N, 

0<s<t 

where g(x) := 7xe 2x . 



Proof. Since Vjv(-) * s a counting process with continuous compen- 
sator An, we have from (5.4) and (5.6) that 

BM 2 N (t A t n ) = EA N (t A t n ) 

< {H^ N) + Hp N) } (\n\XN{s A t n ) -X N (sA t n )\\i 
Jo 

+ B\\X N (s At n ) - Nx N (s AT N )\\i}ds. 

The first expectation is bounded using Lemma 5.1, the second from (5.11), 
from which it follows that 

BM 2 N (t A t n ) < g{t{Hp N) + H (2 ' N) )) M$NV 2 y/]ogN. 

The lemma now follows from the Levy-Kolmogorov inequality. □ 

We are finally in a position to complete the proof of Theorem 3.1. 



Proof of Theorem 3.1. Suppose that N satisfies (5.5). Returning to 
the inequality (5.7), we can now write 

(2N)- 1 sup \\X N (sAT N )-X N (sAT N )\\i<N- l V(tAT N ) 

0<s<t 

< N^Mn^Atn) 

+ f{H^ N) + H^UN' 1 \\X N (s A t n ) - X N (s A t n )\\i 
Jo 

+ \\N~ 1 X N (s At n ) - x N (s AT N )\\i}ds. 

From Lemma 5.2, taking y = yN = Mjl A r_1//2 v / log N, we can bound the 
martingale contribution uniformly on [0,T] by Mjf N^^y/log N, except on 
an event of probability at most 

9(T(H£' N) + Hp N) ))N- 1 ' 2 . 
By Theorem 4.5, for any r > 0, we can find a constant K r such that 
sup \\ir x X N (t) -x N (t)\\i < K r (M$ + F m n)N~ 1/2 log 3/2 N, 

0<t<T T 
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except on an event of probability 0{{M^) m2+2 N r ). Hence, once again by 
Gronwall's inequality, it follows that, except on these exceptional events, 

iV" 1 SUp \\X N (s /\Tn) - X N {jB t\T N )\\ X 
0<s<t 

< 2jv -l/2 e 2t(4 1 ^+4 2 ^)) 

x {M^VbgAf + T(H^ N) + Hp N) )K r {M$ + F m n ) log 3 / 2 N}. 

Combining this with Theorem 4.5, and since also, by (5.10), ~P[tn <T] = 
0((M^) m2+2 N~ r ) for any r, the theorem follows: 

N- 1 sup \\X N (t) - Nx N {t)\\ 1 > K(T)N~ 1/2 log i/2 N = 0(N- 1/2 ). 

0<t<T 

Note that the inequalities (5.5) are satisfied for all N sufficiently large, 
and that the constant K{T) and the implied constant in 0(N~ 1 ^ 2 ) can 
be chosen uniformly in N, because, under the conditions of the theorem, 
||^Af(0) — x(0)||n — > as N — > oo, with the result that, for all large enough N, 
Gj: and Mjf can be replaced by Gt + 1 and Mr + 1 respectively, with the 
corresponding modifications in H^' N ' , H^' N ^ and F m n. □ 

6. Examples. In this section we show that the assumptions that we have 
made are satisfied by a number of epidemic models. These include the models 
introduced in Barbour and Kafetzaki (1993) and in Barbour (1994), both of 
which were generalized and studied in depth in Luchsinger (1999, 2001a, 
2001b). 



6.1. Luchsinger's nonlinear model. In Luchsinger's nonlinear model, the 
total population size is fixed at N at all times, with j3i(x) = 5i(x) =5i = for 
all i > and x E t\\. The matrix a is the superposition of the infinitesimal 
matrices of a linear pure death process with rate \x > and of a catastrophe 
process which jumps from any state to at constant rate k > 0. The first 
of the above expresses the assumption that parasites die independently at 
rate \i. The second corresponds to the fact that hosts die independently at 
rate k, and their parasites with them; whenever a host dies, it is instantly 
replaced by a healthy individual. Thus, the positive elements of a are given 
by 

ai s i-i = i(i, a i0 = H;, i>2; <5 10 = /i + re, 

and an, i > 1, is determined by (2.1). The elements otoj are all zero. It is easy 
to check that assumptions (2.3)-(2.5) are satisfied, with w = 0, mi = fi + re 
and ??i2 = 1- 
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As regards infection, hosts make potentially infectious contacts at rate 
A > 0, and infection can only occur in a currently uninfected host. If a 
host carrying i parasites contacts a healthy one, infection with I parasites is 
developed by the healthy host with probability pn , where J2i>oPu = 1 f° r au * 
and pqq = 1. Here, the distribution Fi = (pu, I > 0) is the 2-fold convolution 
of Fi, modeling the assumption that, at such a contact, the parasites act 
independently in transmitting offspring to the previously healthy host. These 
rules are incorporated by taking 

i>\ 

and the remaining aa(x) are all zero. Thus, for z > 0, we can take aoo = 
aio = and 

a-oi(z) = \, an(z) = Amax{0, 1}, 

where 9 is the mean of F±, the mean number of offspring transmitted by a 
parasite during an infectious contact: thus, J2i>oPu(l + 1) = i0 + 1. 

6.2. Luchsinger's linear model. In Luchsinger's linear model there is tac- 
itly assumed to be an infinite pool of potential infectives, so that the 0- 
coordinate is not required, and its value may if desired be set to 0; the 
population of interest consists of the infected hosts, whose number may 
vary. The matrix a is the infinitesimal matrix of a simple death process 
with rate (J, > 0, but now restricted to the reduced state space, giving the 
positive elements 

a,i^i = ipL, i>2; 

hosts losing infection are now incorporated by using the <5$, with 

Si = K, i>2; 5± = K + fi, 

again with oca, i>2, determined by (2.1). Assumptions (2.3)— (2.5) are again 
easily satisfied. Only a member of the pool of uninfected individuals can be 
infected, and infections with i parasites occur at a rate XJ2i>i X l pu, so that 
we have 

(3i(x) = \^Tx l pu, i>l, 
l>i 

with all the au{x) and 5i(x) equal to zero. Here, for z > 0, we can take 
bio = and 



boi(z) = X, bu(z) = Amax{#, 1}. 
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6.3. Kretzschmar's model. In Kretzschmar's model mortality of para- 
sites is modeled as in Luchsinger's nonlinear model. In addition, hosts can 
both die and give birth, with death rates increasing with parasite load, and 
birth rates decreasing. In our formulation, the positive elements of a are 
given by 

ai t i-i = ifi, i>l, 

and we take 

5i = K + ia, Si(x) = 0, 

for nonnegative constants k and a; again, the an are determined by (2.1). 
Assumptions (2.3)-(2.5) are easily satisfied. Hosts are born free of parasites, 
so that Pi(x) = for i > 1, and 

/3o(x)=/?5>T, 

for some /3 > and < £ < 1. Here, we can take bio = and, for z > 0, 
b 01 (z) = bu(z) = (3. 

Infection only takes place a single parasite at a time, but at a complicated 
state dependent rate. The au(x) are all zero except for I = i + 1, when 

a i>i+1 (x) = Xtp(x), i>0, with ip{x) = ^jx 3 j I c + ^V \. 

j>i l i>o J 

Unfortunately, for c > 0, we cannot improve substantially on the bound 

\ip(x) - tp(y)\ < c _1 ||x- y||ii +c" 2 (||j;||ii A ||y||n)||x — 2/ 1 1 1 , 

which does not yield a suitable bound for aoi(x,y), because of the appear- 
ance of \\x — 2/ 1| ii • However, the same average rate of parasite transmission 
is obtained if we define 

(6.1) a i>i+j (x) = \xi/lc + J2x l \, j>l, «>0, 

I i>o J 

or, more generally, much as in Luchsinger's models, 

(6.2) a iji+j (x) = uY,x l Pij/\c + Y, xl \i J>1, *>0, 

l>i { i>o ) 

where J2j>i3Pij = ^ f° r each I > 1, and vQ = A. With these rates, for c> 0, 
our conditions are satisfied with 

«oo = «io = 0, 
aoi(x,y) = 2i/c _1 , 

an(x,y) = 2A{c~ 1 + C - 2 (||j;||ii A ||y||n)}. 
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In Kretzschmar's model parasites are assumed to be ingested singly, but 
potentially arbitrarily fast. In our variant there is a fixed rate v of taking 
mouthfuls. Each mouthful leads to the ingestion of j parasites with proba- 
bility related to the relative frequencies of /-hosts, I > 1, in the population, 
and to the chance pij that an /-host transmits j parasites in the mouthful. 
At least for grazing animals, this would seem to be a more plausible de- 
scription of what might happen. Kretzschmar's model can, for instance, be 
interpreted as the limit of this model as v — > oo, with 9 = \ jv for fixed A 
and {pij, j > 1} the probabilities from the Poisson distribution Po(W). Note 
that, as is to be expected, the value of ao\(x, y) tends to infinity in this limit. 

APPENDIX 

In this section we prove a lemma used in the proof of Lemma 4.3. 

Lemma A.l. Suppose that Ui>0 for % > 0, and that J2i>o(i + ^) u i — 
M < oo for some M > 1. Let I N := {i > : m > 1/N}. Then, for any N>9, 
we have the following: 

(i) J2V^H<(M + l)V^gN, 

(ii) U i ^ ( M + l ) N ~ 1 ' 2 V^gN, 



(iii) ]r{v/A^A (2N Ui )} < 3(M + l)y/N\ogN. 

Proof. Define Jn ■= {i > 0: (i + l)y/m\ogN > 1} and K N := {i: 
< i < y/~N } . Then we have 

E\/^ = E V^i + E V^i 



*—L (i + l)vlogiV fri 



< (M + l)VlogiV. 
A similar calculation then gives 

E Ui = E Ui + E Ui 

i0N i&K N \I N i£(I N UK N ) 



<Y, N ~ 1 + E N- l ' 2 i Ul 

i= i>\_-/N\ 

< N~ 1/2 + iV" 1 + MN' 1/2 < N' 1/2 (M + l)^/^ogN 
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in N > 9. Part (iii) combines the two results. □ 
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